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Abstract 

The tail of the distribution of a sum of a random number of independent and 
identically distributed nonnegative random variables depends on the tails of the 
number of terms and of the terms themselves. This situation is of interest in the 
collective risk model, where the total claim size in a portfolio is the sum of a random 
number of claims. If the tail of the claim number is heavier than the tail of the claim 
sizes, then under certain conditions the tail of the total claim size does not change 
asymptotically if the individual claim sizes are replaced by their expectations. The 
conditions allow the claim number distribution to be of consistent variation or to be 
in the domain of attraction of a Gumbel distribution with a mean excess function 
that grows to infinity sufficiently fast. Moreover, the claim number is not necessarily 
required to be independent of the claim sizes. 
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1 Introduction 



In the collective risk model in actuarial risk theory, the total claim amount in 
a portfolio is assumed to be a random sum, 



N 



where N, the claim number, is a nonnegative integer-valued random variable 
and Xi,X 2 , . . ., the claim sizes, form a sequence of independent and identi- 
cally distributed (iid) nonnegative random variables, independent of N; see 
e.g. Kaas et al. (2001, chapter 3). The tail of the compound distribution de- 
pends on the tails of the claim number and claim size distributions. If both 
distributions are light-tailed, that is, if E[e tAr ] < oo and E[e* Xl ] < oo for some 
t > 0, saddlepoint approximation techniques can be used to analyze the tail 
of Sn (Beard et al., 1984; Jensen, 1995). If the individual claim distribution 
is subexponential and if the claim number distribution is light-tailed, then 

Pr[S N >x}~ E[N] Pr[Xi > x], x -> oo (1.1) 

(Embrechts et al., 1997, Theorem A3. 20); see Denisov et al. (2007) for one- 
sided versions of eq. (1.1) under milder conditions. 

In this paper we concentrate on the converse case when the tail of Sn is dom- 
inated by the tail of N. This setting is relevant, for instance, for earthquake 
insurance, featuring a potentially large number of bounded claims, or in the 
case of individual unobserved heterogeneity, as well as in queueing theory, see 
Section 6. We try to answer to the following question raised in Schmidli (1999): 
When does 

Pr[S N > x] ~ Pr[E[Xi]iV >x], x^oo (1.2) 
hold? We consider a slightly more general framework because we do not nec- 
essarily assume the claim number to be independent of the claim sizes. 

Theorems 1.3 and 1.4 in Stam (1973) shed light on the difference between the 
two approximations in eqs. (1.1) and (1.2): if the claim number and claim size 
distributions have finite expectations and regularly varying tails, that is, if 
there exist a > 1 and (5 > 1 such that 

lim > X J1 = y -« l im > fg] = y-0 

x^oo p r [jv > x] Pr[Xi > x] 

for y > 0, then, provided iV and {Xj}j>! are independent, a > f3 implies (1.1) 
while (3 > a implies (1.2). In other words, which of the two approximations 
(1.1) or (1.2) is valid depends on which of the two distributions has the heavier 
tail. Approximation (1.2) will therefore arise when the tail of N is heavier than 
the tail of X. 
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The main contribution of this paper consists of the four theorems in Sections 3 
and 4 providing different sets of sufficient conditions for eq. (1.2) to hold. 
The two theorems in Section 3 concern the case where the claim number 
distribution is of consistent variation and the claim size distribution has a 
finite moment of order r for some r > 1. The two theorems in Section 4 treat 
the case where the claim number distribution is in the Gumbel domain of 
attraction and the moment generating function of the claim size distribution 
is finite in a neighbourhood of the origin. An important factor is the speed 
at which the mean excess function E[N — x \ N > x] tends to infinity. In 
the special case where N is a discretized Weibull random variable with tail 
function Pr[N > x] ~ exp(— x 13 ) as x — + oo and shape parameter < (5 < 1, 
the conditions for Theorems 4.1 and 4.2 translate into (3 < 1/3 and (3 < 1/2, 
respectively. The exponent 1/2 marks the lower boundary on the speed of 
growth of the mean excess function for which eq. (1.2) can still be expected 
to hold; see Section 5. In Theorems 3.1 and 4.1, the dependence between the 
claim number and the claim sizes can be arbitrary. 

The outline of this paper is as follows: Section 2 presents preliminaries on tail 
asymptotics and large deviations. Statements and proofs of our main results 
are to be found in Sections 3 and 4. Section 5 gives a short discussion. Section 6 
concludes the paper with applications to insurance and operations research. 

The following notations and assumptions are in force throughout the paper. 
Let N be a nonnegative integer- valued random variable and let Xi,X 2 , ... be 
a sequence of iid nonnegative random variables with finite mean /i = E[Xi] 
and variance a 2 = Vai[Xi\. The claim number N is not necessarily assumed 
to be independent of {Xj}^. Denote S = and S n — Xi + ■ — h X n for an 
integer n > 1. For real y, we denote by \y~\ the smallest integer greater than 
or equal to y; similarly, [_y\ is the greatest integer smaller than or equal to y. 
For two positive functions / and g defined in a neighbourhood of infinity, we 
write f(x) ~ g(x) as x — > oo provided lim^oo f(x)/g(x) = 1 and we write 
f(x) = o(g(x)) as x — > oo provided lirn^oo f(x)/g(x) = 0. The indicator 
function of an event A is denoted by 1(A). 



2 Preliminaries 



2.1 Consistent variation 



A distribution function F is said to have a consistently varying tail, notation 
F e C, if 

F { xv i 

limlimsup =- = 1, (2.1) 

yn x->co F Fix) ' v ; 
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where F — 1 — F. This regularity property was first introduced in Cline 
(1994) and called "intermediate regular variation". The concept of consistent 
variation has been proven useful in various papers in queueing systems and 
ruin theory (Cai and Tang, 2004; Kaas and Tang, 2005; Ng et al., 2004; Tang, 
2004). 

A distribution function F is said to have a regularly varying tail, notation 
F e TZ, if there exists a > such that 

F(xy) 

lim = y~ a , y>0, 

F(x) 

or equivalently such that 

F(x) = x~ a L (x), x> 0, 
where L is a slowly varying function, that is, a function satisfying 

lim y>0. 

In Cline and Samorodnitsky (1994), it is shown that the class C strictly con- 
tains the class TZ. In addition the class C is itself strictly contained in the class 
of distribution functions with a dominatedly varying tail, that is, the class V 
of distribution functions F such that 

lim sup 5^ < oo (2.2) 

x^oo F[X) 

for all (or, equivalently, for some) y G (0, 1). 
For a distribution function F, define 

F,(y)=liminf^M, y > 0, 



and 



(Xp = lim < oo, 

y->oo log y 



with — logO = oo. In the terminology of Bingham et al. (1987, Proposi- 
tion 2.2.5), the quantity ctp is the upper Matuszewska index of the function 
1/F. It is an upper bound for the upper order, pp, of 1/F, defined by 

p^ = hmsup . (2.3) 

x^oo log X 

By Bingham et al. (1987, Theorem 2.1.7), a distribution function F belongs 
to V if and only if a F < oo. But then p F < a F < oo. As F(x) = x~ p ^ with 
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lim sup^^ p(x) = pf, we obtain that for every p > pp, 

x -p = o(F(x)), x -> oo. (2.4) 

In this sense, distributions of dominated variation (and thus also those of 
consistent variation) are heavy-tailed. 

2. 2 Domains of attraction 



Let N be a nonnegative integer-valued random variable. Assume that N is 
in the maximal domain of attraction of an extreme value distribution. Be- 
cause the distribution of N is discrete, its attractor must be a Frechet or the 
Gumbel distribution, that is, its attractor cannot be a Weibull extreme value 
distribution. 

On the one hand, the distribution of N is in the domain of attraction of the 
Frechet distribution with shape parameter a > if and only if 

Pr[iV >x}= x~ a L (x) (2.5) 

where L is a slowly varying function. This case was treated in Stam (1973) and 
in Fay et al. (2006); see Section 6. A sufficient condition for (2.5) is the von 
Mises type condition lim^oo nPr[iV = n]/Pr[iV > n] = a (Anderson, 1980). 
In turn, regular variation implies consistent variation, that is, (2.5) implies 
(2.1) with F(x) = Pt[N < x], and it is merely the latter concept which will 
be used later on. 

On the other hand, the distribution of N is in the domain of attraction of the 
Gumbel distribution if and only if there exists a positive function a defined in 
a neighbourhood of infinity such that 

PrLV > x + ya(x)} „ _ 
lim 1 - rAr — V^ 1 = e ~\ V e K. 2.6 
—oo p r [N > x ] 



X 



The function a is necessarily asymptotically equivalent to the mean excess 
function, a(x) ~ E[iV — x \ N > x] as x — > oo, and it is self- neglecting, that 



is, 



a(x) , a(x + ya(x)) m , „. 

lim = 0, lim / x v ;; =1, y eR. 2.7 

x x ^°° a(x) 



x^oo 



The tail function of iV admits the following useful representation: 

Pr[N >x] = c(x) expf - [* ^\dt] (2.8) 

\ Jo a(t) J 
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where lim^oo c(x) = c G (0, oo) and lim^oo g (t) = 1. By Anderson (1980), 
a sufficient condition for N to be in the Gumbel domain of attraction is that 
q n — > oo and q n+ i — q n — > as n — > oo, where g n = Pr[iV > n]/ Pr[iV = n]. 



Example. Let iV be a discretized Weibull variable, that is, N = [Y\ where 
Pt[Y > y] = exp(— y 13 ) for y > 0, with parameter /3 > 0. If (5 < 1, then 
Pr[JV > y] ~ Pr[F > y] as y -»• oo and (2.6)-(2.8) hold with a(x) = p~ l x l ~ p . 



2.3 Large deviations 



In order to analyse the tails of the compound sum SV, we need bounds on 
the probability of large deviations of S n . The first bound is a special case of 
Lemma 2.3 in Tang (2006). 

Lemma 2.1 // E[X[] < oo for some r > 1, then for each 7 > and q > 0, 
there exist positive numbers v and C = C (17,7) irrespective to x and n such 
that for all x > and n = 1,2,..., 

Pr[S n — nji> x] < nPr [X\ — fJ> > vx] + Cx~ q . 



The second result is a simple consequence of Cramer's theorem on large de- 
viations; see Feller (1971, equation (7.30), p. 553) or Baltrunas et al. (2004, 
Lemma 2.2). 

Lemma 2.2 7/E[e <Xl ] < 00 for some t > 0, then for any sequence a n satis- 
fying a n /n 1 / 2 — > 00 and a n /n — > as n — > oo ; 

Pr[S n > nfi + a n ] = expf -J-^»{1 + (1)} j , 



Pr[S n <nfi- a n ] = exp ( -7^— {1 + o(l)} 



2a 2 n 
2a 2 n 



3 Main results: Consistent variation 



In this section, we treat the case where the tail of the distribution function of 
iV is of consistent variation, see eq. (2.1). Theorem 3.1 states a simple sufficient 
condition on the common distribution of the Xj for eq. (1.2) to be valid for 
arbitrary dependence structure between N and {Xi}. In case of independence, 
a weaker condition suffices, see Theorem 3.2. The results crucially rest on a 
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large-deviations result by Tang (2006, Lemma 2.3), reproduced as Lemma 2.1 
above, and which was kindly pointed out to us by an anonymous Referee. 

Theorem 3.1 If the tail of the distribution of N is of consistent variation, if 
E[X[] < oo for some r > 1 and if 

rrPrLYx > x] = o(Pr[N > x}), x -> oo, (3.1) 

then Pt[Sn > x] ~ Pr[iV > x/fj] as x — > oo. 



Proof. Since Pr[Sjv > x] — Pt[Sn/^ > x//i], we can without loss of gener- 
ality assume that /x = 1. Fix < e < 1. For x > 0, 

Pr [iSjv > x] = Pr[S N > x, N < (1 - e)a:] + Pr[SW > x, iV > (1 — e)x], (3.2) 

so that, on the one hand, 

Pr[S N >x}< Pr[S , L (i_ e)xJ > x] + Pr[iV > (1 - e)x], (3.3) 
and, on the other hand, 

Pr[S N > x] > + Pi[S Kl+e)x - ] > x, N > (1 + e)x] 

> Pr[iV > (1 + e)x] - Pr[S l(1+£)x] < x}. (3.4) 



Since the tail of N is of consistent variation, by eq. (2.1), 

u mUmiP f pr i Af> ; i+ fi 1= i. 

elO x^oo p r [AT > (1 - e)x\ 



(3.5) 



In view of eqs. (3.3), (3.4) and (3.5), Pr[<Sjv > x] ~ Pr[A^ > x] as x — > oo will 
follow if we can show that 



?AS[{i-e)x\ >x] = o(Pr[N >x\), x-^oo; (3.6) 
Pr[S l(1+£)x] <x]= o(Pr[iV > x}), x -> oo. (3.7) 

We first show eq. (3.6). Let ajv be the upper Matuszewska index of the function 
x i— > 1/ Pr[iV > x]. Since the tail of iV is of consistent variation, o;^ < oo (see 
subsection 2.1). Pick p > a N . By eq. (2.3), 

x -p = o(Vi[N > x}), x -> oo. (3.8) 

By Lemma 2.1, there exist positive numbers i> and C such that 
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Pr[S L( i_ £)a; j > x] < Pr[5 L(1 _ e)a; j - - e)x\ > ex] 

< [(1 - e)x\ Pr[Xx > vex] + C(£x)- p 



and thus 



PrfS^i-^j > x] < 1 -uexPrLYi > vex] Pr[iV > vex] C(ex)~ p 



Pr[iV > x] ~ ve Pt[N > vex] Pr[N > x] Py[N > x] ' 

Eq. (3.6) now follows from the above inequality combined with eqs. (2.2), 
(3.1), and (3.8). 

Next we show eq. (3.7). Since the Xi are nonnegative, by Chernoff's bound, 
there exists < a < 1, depending on e > 0, such that for all sufficiently large 

x, 

Pr[S [(1+£)x] <x]< a x . (3.9) 
This inequality in combination with (3.8) yields eq. (3.7). □ 

Theorem 3.2 Assume that N and {Xj}j>i are independent. If the tail of the 
distribution of N is of consistent variation, ifE[X{] < oo for some r > 1 and 
if one of the following two conditions holds: 

case E[N] < oo: 

Pr[Xi > x]= o(Py{N > x]), x -> oo; (3.10) 

case E[N] = oo: there exists q with 1 < q < r such that 

E[Nl(N<x)] , Q11 . 
h ^ P ,-MV.V -, <0 ° ; < 3 - U > 

then Pt[Sn > x] ~ Pr[iV > x//i\ as x — > oo. 



Proof. The proof is similar to that of Theorem 3.1. We just indicate the 
modifications. In view of eq. (3.2), it suffices to show that 

Pr[S N > x, N < (1 - e)x] = o(Pr[N > x]), x -> oo. (3.12) 

Since iV and {Xj}j>! are independent, 

L(l-e)xJ 

PrfSV > x, iV < (1 — e)x\ = ^ Pr[iV = k] Pr[S k — k > x — k\. 

Pick p > ajy, the upper Matuszewska index of the function x \— > 1/ Pr[iV > x]. 
By Lemma 2.1, there exist positive numbers v and C such that 
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Pr[S N > x, N < (1 - e)x\ 

L(l-e)xJ / 

< ^ Pr[iV = A;] [k Pr[Xi > u(x - fc)] + 



fc=0 



(x-ife) 



L(l-e)a:J £r 

< Pr [X x > evx 

k=0 



(ex)P 



■■ Pr[Xi > era]E[iVl{iV < (1 - e)x}] + 



(ex)?' 

In view of eq. (3.8), we only need to deal with the first term on the right-hand 
side of the previous display. We now invoke the additional condition. On the 
one hand, if E[N] < oo, then also E[iVl{iV < (1 - e)x}] < E[N] < oo, and 

PvlXi > evx] Pt\N > evx] Pr\Xi > evx] 

— — — — > X — > OO 

Pr[N > x] Pr[iV>a;] Pr[N > evx} 

where we used eqs. (2.2) and (3.10). On the other hand, if E[N] = oo, then, 
as Pr[Xi > x] < E[X[]/x r , 

Pr[X l > evx]E[Nl{N < (1 - s)x}\ = 0(x' r )0(x q Pr[N > (1 - e)x\) 

= o(Pt[N>x]), x — > oo, 

as required. □ 



Remark. Observe that if E[N] = oo and the function x h- > Pr[N > x] is 
regularly varying of index —a for some a G (0, 1], then eq. (3.11) holds, for, 
by Karamata's theorem (Bingham et al., 1987, Propositions 1.5.8 and 1.5.9a), 

E[iVl(iV < x)\ < [ X Pr\N > y]dy 
Jo 



~ x Pt[N > x]/(l - a) if < a < 1, 
= o(x q Pr[N > x]) if a — 1 < q. 



4 Main results: Gumbel domain of attraction 



Next, we treat the case where the claim number distribution is in the Gumbel 
domain of attraction (see subsection 2.2) and the moment generating function 
of the claim size distribution is finite in a neighbourhood of the origin. An 
important factor is the speed at which the auxiliary function a in (2.6) tends 
to infinity. If a(x)/x 2 ^ 3 — > oo as x — > oo, then (1.2) holds without further 
conditions (Theorem 4.1). If the function a is merely assumed to have a lower 
order larger than 1/2 in the terminology of Bingham et al. (1987, Section 2.2.2, 
p. 73), then (1.2) still holds provided the claim number is independent of the 
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claim sizes (Theorem 4.2). The order 1/2 marks the lower boundary on the 
speed of growth of a for which (1.2) can still be expected to hold true; see 
Section 5. 

Theorem 4.1 7/E[e 7Xl ] < oo for some 7 > and if (2.6) holds for a function 
a such that 

a ( x ) I A 1\ 

^^00, x^oo, (4.1) 
then Pt\Sn > x] ~ Pr[iV > x/fj] as x — > 00. 



Proof. Without loss of generality, assume fx = 1. Fix e > 0. Write 
Pr[S N > x]= Pr[S N > x, N < x-ea(x)}+Pr[S N > x,N > x-ea(x)\. (4.2) 
By Lemma 2.2, the first term on the right-hand side of (4.2) is bounded by 

Pi[S N > x, N < x - ea(x)] < Pr[Si x - ea ( x )\ > x] 



( e 2 a 2 (x) ri ^,,\ 

= exp (-^ . {l + o(l)} )' 



x — > 00. 



By (4.1), a 2 (x)/x is of larger order than Jq a l (i)dt, so that the right-hand 
side in the previous display must be o(Pr[iV > x\) as x — > 00. 

The second term on the right-hand side of (4.2) is bounded from above by 

Pr[S;v > x, N > x - ea(x)\ < Pr[iV > x — ea(x)] 

and from below by 

Pr^jv > x, N > x — ea(x)] > Pr[S , ^ +ea ( a; )] > x, N > x + ea(x)] 

> Pi[N > x + ea{x)\ - Pr[S lx+£a(x)] < x}. 

The second term on the right-hand side is o(Pr[iV > x]) as x — > 00 by the 
same argument as in the previous paragraph. Moreover, by (2.6), 



lim 



Pr[iV > x + ea{x)\ 



e 



-2e 



x->co p r [AT > x — ea(x)] 
As e > was arbitrary, indeed P^S^r > x] ~ Pr[A^ > x] as x — > 00. □ 

Theorem 4.2 //AT and {Xj}j>i are independent, if E[e jXl ] < 00 for some 
7 > and i/ (2.6) holds for a function a such that 

.. . , loga(x) 1 „. 
hmmf , > -, (4.3) 

x^co logo; 2 
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then Pt[Sn > x] ~ Pr[iV > x/fj] as x — > oo. 



Proof. Without loss of generality, assume /x = 1. Let A(x) := loga(x)/ logx, 
so a(x) = x A ^. By (2.7) and (4.3), 

1/2 < Ao := liminf X(x) < limsupA(a;) < 1. 

x >oo x — >co 

Let k be an integer larger than 2 such that A >l/2 + l/2 fe and decompose 
Pr[S N >x}= T x {x) + T 2 (x) + T 3 (x), (4.4) 

where 

T^x) = Pr[S N >x,N < [x-x 3 / 4 \], 

T 2 (x) = Pr[S N >x,[x- x 3 / 4 \ < N < [x — x 1 ' 2 ^' 2 ^}, 

T 3 (x) = Pr[S N >x,[x- x 1 ' 2 ^' 2 ^ < N\. 
We will show that for i = 1,2, 

Ti(x) = o(Pi[N > x]), x -> oo, (4.5) 

and, for arbitrary e > 0, 

e~ £ < lim inf T ^ - < lim sup T ^ - < e £ . (4.6) 

Since e is arbitrary, the combination of equations (4.4), (4.5) and (4.6) implies 
lim^oo Pr[SV > x]/ Pr[N > x] — 1, as required. 

The term T±{x). 

Denoting n(x) = [x — x 3 / 4 J, we have T\(x) < Pr[S n ( x ) > x] = PrfS^) > 
n(x) + {x — n(x)}]. Since x — n(x) ~ a; 3 / 4 is of smaller order than x but of 
larger order than x x l 2 as x — > oo, an application of Lemma 2.2 yields 

Ti(x) <exp^-^ 1 / 2 {l + o(l)}^, x^oo. 

On the other hand, since x 1 ! 2 = o(a(x)) as x — > oo, the representation in (2.8) 
implies exp(— 5a; 1 / 2 ) = o(Pr[iV > x]) as rr — > oo for all 5 > 0. Equation (4.5) 
for i = 1 follows. 

The term T 2 (x). 
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For integer j > 2, write tj = 1/2 + l/2 j . Clearly 3/4 = r 2 > r 3 > • • • > r k > 
1/2. Further, denote rij(x) = [x — x Tl \. We have n 2 (x) < n 3 (x) < • • • and 

T 2 (x) = Pr[S N > x,n 2 (x) < N < n k (x)\ 

= Pr i S N > x,nj(x) < N < n j+1 (x)] 

3=2 

< Y;Pr[S nj+l{x) > x] Pr[N > nj (x)}. (4.7) 

3=2 

A similar argument as for the term Ti(x) in the previous paragraph yields 

Pr[S nj+l{x) >x] = exp^-^x 2 ^- 1 ^ + o(l)}j 

= exp^-^x 1 /* {1 + o(l)}^, x - oo. 

On the other hand, since x 1//2 = o(a(x)) as x — > oo, the representation in (2.8) 
implies 

, Pr[iV > nAx)] . 1/2J . 
log i, J \ n = o(x 1/2 ), x^oo. 
6 Pr[iV > x] v h 

Combine the final two displays to derive that every term on the right-hand 

side of (4.7) is o(Pr[iV > x\) as x — > oo. 

The term T 3 (x). 

Fix e > 0. On the one hand, since a(x) = x x( - x \ the choice of k entails that 
^1/2+1/2 _ ( a ( x ^ as a; -> oo. Hence 

T 3 (x) = Pr[S N > x,N > [x — x 1/2+1/2k \] < Pr[N >x- ea(x)}, 
whence, by (2.6), T 3 (x) < {1 + o(l)}e e Pr[iV > x] as x -> oo. 
On the other hand, denoting m(x) = [x + ea(x)\ , 

T 3 (x) > Pr[S N >x,N> m(x)} 

> Pr[S m(x) >x,N> m{x)\ = {1 - Pv[S m{x) < x}} Pr[N > m{x)\. 

(4.8) 

Since m(x) = x + {e + o(l)}a(x) as x — > oo, by Lemma 2.2, 
Prf-S'm^) < x] = Pr[S m(x) < m(x) - {m(x) - x}] 

= expf-^M{l + O (l)})->0, x^oc. (4.9) 
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Moroever, by (2.6), Pr[iV > m(x)} ~ e~ E Pv[N > x] as x — > oo. This relation 
in combination with (4.8) and (4.9) yields T 3 (x) > {1 + o(l)}e~ £ Pr[iV > x] as 
rr — > oo. This finishes the proof of (4.6) and hence of the theorem. □ 



Example. Let N be a discretized Weibull variable with shape parameter 
< (3 < 1 as in the example in subsection 2.2. Then Theorem 4.1 applies for 
(3 < 1/3, while Theorem 4.2 applies as long as f3 < 1/2. 



5 Discussion 



At first sight, Theorem 4.2 does not seem to entail much of an extension 
compared to the following corollary to Theorem 3.6 in Asmussen et al. (1999). 

Corollary 5.1 If N and {Xj}j>! are independent and if eq. (2.6) holds for a 
function a such that 

(i) a(n)/n l l 2 — > oo as n — > oo, 

(ii) Pr[S n > n/i + ca(n)] = o(Pr[iV >n\) as n — > oo /or a// c > 0, 
£/ien Pr[SAr > rr] ~ Pr[A^ > x/fj] as x — > oo. 

Indeed, the only real difference of Corollary 5.1 with respect to Theorem 4.2 
seems to be the extra condition (ii). However, this condition turns out to be not 
so harmless: Although condition (i) only requires a{n) to grow to infinity at a 
faster rate than n 1//2 , the following lemma shows that condition (ii) effectively 
forces a much faster rate on a, comparable to the one imposed in Theorem 4.1. 

Lemma 5.2 Under the conditions of Corollary 5.1, ifE[e lXl ] < oo for some 
7 > and if the function a is regularly varying of index 5 < 1, then necessarily 
5 > 2/3. 



Proof. By (2.8) and by Lemma 2.2, 
PrfSVj > nfi + a(n)] 



Pr[N > n] 



0(1)1 + ri A - logc(n) ) 



(5.1) 

as n — ► oo. By Karamata's theorem (Bingham et al., 1987, Proposition 1.5.8), 



Jo 



g(t) 1 n 
a[t) l — o a[n) 
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On the one hand, the function t h- > a 2 (t)/t is regularly varying of index 25 — 1; 
on the other hand, the function t h- > t/a(t) is regularly varying of index 1 — 5. 
Hence, if the expression in (5.1) converges to zero as n — > oo, then necessarily 
25- 1 > 1 -5, whence 5 > 2/3. □ 

Finally, the lower bound 1/2 for the order of the mean excess function a in 
Theorem 4.2 seems to mark the minimal weight that must be present in the 
tail of N for the asymptotic equivalence Pr [Sn > x] ~ Pr[iV > xj fi] to be true. 
Assume for example that the distribution of Xi is unit-mean exponential and 
that N is independent of {^}i>i- For t > 0, the distribution of the random 
variable Z t = max{n = 0, 1, . . . : S n < t} is Poisson with mean t, and 

Pr[S N >t] = Pr[N > Z t ] = E[exp{-g(Z t )}}, 

where g(x) := — logPr[iV > x]. The asymptotic behavior of the final expres- 
sion in the previous display as t — > oo has been studied in Foss and Korshunov 
(2000) in the general case where Z t is the sum of iid nonnegative random 
variables; in our case, Z t is for integer t the sum of independent random vari- 
ables with common Poisson distribution and mean 1. By Foss and Korshunov 
(2000, Theorem 5.1), if iV = [Y\ and Pr[Y > y] = exp(-/) for y > where 
P E [1/2,2/3), then 

Pr[S N > x] ~ Pr[iV > x] exp((3 2 x 2l3 ~ 1 /2), x -> oo. 

In particular, PrfSV > x] is not asymptotically equivalent to Pr[iV > x]. In 
Asmussen et al. (1999), the exponent 1/2 was found to be critical as well. 



6 Applications 



Earthquake insurance. Earthquake insurance provides coverage to the 
policyholder in the event of an earthquake that causes damage to the policy- 
holder's properties. Insurance companies must be careful when assigning this 
type of insurance because, even if the individual claims are bounded by the 
value of the properties insured, the number of claims can be very large. An 
earthquake strong enough to destroy one house will probably destroy hundreds 
of houses in the same area. 

Let us assume that, given the energy of the earthquake A = A, the number 
of claims has a Poisson distribution with parameter f3\ where (3 is a posi- 
tive constant. Despite of the apparent complexity involved in the dynamics 
of earthquakes, the probability distribution of the energy of an earthquake 
follows a power law distribution known as the Gutenberg-Richter law (Guten- 
berg and Richter, 1944): Pr[A > A] = \~ a L(\) where the exponent a is an 
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universal exponent close to 1, universal in the sense that it does not depend 
on a particular geographic area, and L is a slowly varying function. 



We claim that as x — > oo, 

Pr[N > x] ~ Pr[A > x/(3] = (x/{3y a L(x/[3). 

The proof goes as follows. Without loss of generality, assume (3 = 1. Let F\ 
denote the distribution function of the Poisson distribution with mean A. Note 
that if < A < fi < oo, then 1-F X < 1-F^ = 1-F A *F M _ A . Also let Z U Z 2 ,.. ■ 
denote iid random variables with common Poisson distribution and mean 1. 
Then 



Pi[N >x]= E[Pr(iV > x | A)] = E[l - F A (x)\ 
<E[1-F w (x)} 

J2 Zi > x 



Pr 



i=l 



and similarly 



Pr[N > x] > Pr 



'LAJ 

J2 Z i > X 

i=l 



Since the tail function of A is regularly varying and since A — 1 < |_AJ < [A] < 
A + 1, we find as A — > oo 

Pr[[AJ > A] ~ Pr[A > A] ~ Pr[[A] > A]. 

The claim now follows from our Theorem 3.1 and the above upper and lower 
bounds on PrLV > x\. 

By Theorem 3.1, the distribution of the total claim amount in the portfolio has 
a regularly varying tail as well. Let us note that it is not necessary to assume 
that the individual claim amounts are independent of the claim number. In 
fact the energy of the earthquake may also have an impact on the distribution 
of the individual claim amounts. 



Hierarchical unobserved heterogeneity. Let us consider an hierarchical 
heterogeneity model for the number, N, of claims of a policyholder. First 
assume that, given A = A, the claim number has a Poisson distribution 
with parameter A. Secondly assume that, given V = v, A has an exponential 
distribution with mean v > 0. It follows that 

Pr[N = n | V = v] = — - - 1 . , 
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that is, given V — v, N follows a Geometric distribution with succes proba- 
bility parameter v/(l + v). Thirdly assume that log(l + V) follows a Gamma 
distribution with shape parameter 7 > and scale parameter c > 0. We 
deduce that 

Pr[iV >n] = E[ e - nlog(1+y) ] = ( 
1 ~ 1 1 1 \c + n 

that is, N has a Pareto distribution with index 7. 

It is well-known that the omission of an individual unobserved heterogeneity 
leads to overdispersion (in the sense that the variance is larger with hetero- 
geneity). The proposed hierarchical model shows that unobserved heterogene- 
ity can lead to grossly incorrect conclusions about the tail of the claim number 
distribution. Assume that their is no heterogeneity for the claim distributions 
and that the common distribution is light-tailed. If the heterogeneity for the 
number of claims is observed, then the distribution of the total claim amount 
is light-tailed, whereas if the heterogeneity is unobserved, then the distribution 
of the total claim amount is heavy-tailed. 




Stationary waiting time of customers. Let {Tj}j>! be a stationary 
sequence of nonnegative random variables with finite mean and set S n = 
Y^i=\Ti- m Resnick and Samorodnitsky (1997), the number of customers, N n , 
in a system seen by the nth arriving customer is defined by N = and 

N n = (JV n _! + 1 - r(S n _!, S n ))+, n > 1, 

where T is a homogeneous Poisson process with intensity u independent of 
{Ti\i>\- The waiting time, W n , of the nth arriving customer satisfies 

N n +1 

i=i 

where Xi, X2, . . . are iid exponentially distributed random variables with com- 
mon mean \i independent of N n . If {Tj}j>i is a reversible, stationary, ergodic 
process and if E[Ti] < /i, then N n converges in distribution to a random vari- 
able N, the distribution of which may, under additional conditions, have a 
regularly varying tail. Whereas in Resnick and Samorodnitsky (1997) only a 
lower bound is given for the tail of 

N+l 

1=1 

Theorems 3.1 and 3.2 give sufficient conditions such that actually 
Pr[W > x] ~ Pr[iV > fi^x], x -> 00. 
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Teletraffic arrivals. A large number of teletraffic measurements shows 
that file sizes and transmission times exhibit heavy tails and long-range de- 
pendence. Standard models for explaining these empirically observed facts are 
the so-called ON/OFF model and the infinite source Poisson model. In Fay 
et al. (2006), a model is introduced that extends these standard models in a 
simple, but realistic way. They assume that the first packet of a flow of data 
arrives at the point Tj of a Poisson process with intensity A > 0. Flow j then 
consists of Kj packets, the kth of which arrives at time Yj k = Tj + Sjk, where 

k 

Sjk = J2 X H> 0<k<Kj. 
1=1 

In Fay et al. (2006), it is assumed that the variables form an array of 
iid nonnegative random variables and the Kj are iid integer-valued random 
variables independent of the X^. Of interest is the tail behavior of the total 
transmission time Sjk under the assumption that the tail of X^ or Kj is 
regularly varying. Proposition 4.3 of Fay et al. (2006) gives results similar as 
those in Theorem 3.2 above. In Fay et al. (2006, Proposition 4.9), the reverse 
problem is considered as well: if the tail of SjKj is regularly varying with index 
—a for some a > and if the tail of Kj is heavier than that of Xji, then what 
can be said about Kj! For instance, if the tail of SjKj is regularly varying with 
index —a ^ —1 and if Pr[Xx > x] = o(Pt[N > x]) as x — > oo, then the tail of 
Kj must be regularly varying with index —a as well. 
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